# Low VRAM optimization
Qwen2.5 Omni 7B AWQ
Other
Qwen2.5-Omni is an end-to-end multimodal model capable of perceiving multiple modalities including text, images, audio, and video, while generating text and natural speech responses in a streaming manner.
Multimodal Fusion
Transformers English

Q
Qwen
77
8
LTX Video 0.9.7 Dev
Other
The first real-time high-quality video generation model based on DiT architecture, capable of generating 1216×704 resolution videos at 30fps
Video Processing English
L
Lightricks
477
7
Auraflow DomoKun LoRA Rank8
Apache-2.0
A standard PEFT LoRA model trained on fal/AuraFlow, specializing in text-to-image and image-to-image generation tasks for Domo-kun characters.
Image Generation
A
bghira
532
0
FLUX Hyperscale Fused
Other
FLUX is a text-to-image generation model that integrates 5 high-quality fine-tuned adapters, capable of producing images in various styles
Image Generation English
F
minpeter
131
2
Wan2.1 Fun 1.3B Control
Apache-2.0
Wan2.1-Fun-1.3B is a text-to-video generation model that supports multi-resolution training and first/last frame prediction.
Text-to-Video Supports Multiple Languages
W
alibaba-pai
22.19k
97
Origami WanLora
Apache-2.0
This is a LoRA adapter based on the Wan2.1-T2V-14B model, designed for generating origami-style videos.
Text-to-Video English
O
shauray
50
2
Phi3 Uncensored Chat
MIT
A fine-tuned version based on microsoft/phi-3-mini-4k-instruct, specifically designed for role-playing dialogues with various characters
Large Language Model
Transformers English

P
luvGPT
77
6
Wan2.1 Fun 1.3B InP
Apache-2.0
Wan2.1-Fun-1.3B is a text-to-video generation model developed by Alibaba PAI team, supporting multi-resolution training and first/last frame prediction.
Text-to-Video Supports Multiple Languages
W
alibaba-pai
6,753
25
Steamboat Willie 1.3b
A LoRA model trained on Steamboat Willie animation clips for generating text-to-video content in golden age animation style
Text-to-Video
S
benjamin-paine
90
3
Cogview4 6B
Apache-2.0
CogView4-6B is a text-to-image model based on the GLM-4-9B foundation model, supporting both Chinese and English, capable of generating high-quality images.
Text-to-Image Supports Multiple Languages
C
THUDM
333.85k
216
Deepseek R1 AWQ
MIT
AWQ quantized version of DeepSeek R1 model, optimized for float16 overflow issues and supports efficient inference deployment
Large Language Model
Transformers Supports Multiple Languages

D
cognitivecomputations
30.46k
77
Omnigen V1 Bnb 8bit
MIT
The 8-bit quantized version of OmniGen-v1, suitable for text-to-image and image-to-image tasks, supporting multimodal input.
Text-to-Image
O
gryan
76
0
Stable Diffusion V3 5 Large GGUF
Other
Stable Diffusion 3.5 Large Model is a multimodal diffusion transformer (MMDiT) text-to-image model, with significant improvements in image quality, text layout, complex prompt understanding, and resource efficiency.
Text-to-Image English
S
gpustack
13.33k
7
Flux Actors Face Inset Cig Cards LoKr
Other
A LyCORIS adapter based on FLUX.1-dev, specializing in text-to-image generation tasks, particularly suitable for work environments.
Image Generation
F
davidrd123
20
1
Flux Fusion V2 4step Merge Gguf Nf4
Other
A text-to-image model formed by merging Schnell, fine-tuned Dev, and Hyper, recommended steps 4-8, with significant quality improvement at 4 steps
Text-to-Image English
F
Anibaaal
1,212
10
Neuraldaredevil 8B Abliterated GGUF
Other
This is a quantized version of the NeuralDaredevil-8B-abliterated model, providing model files of various quantization types, suitable for users with different hardware conditions and requirements.
Large Language Model
N
bartowski
577
11
ALMA 7B Ja V2
ALMA-7B-Ja-V2 is a machine translation model supporting Japanese-English bidirectional translation, with improved performance through additional training on the previous version.
Machine Translation
Transformers Supports Multiple Languages

A
webbigdata
118
18
Etherrealmix 1
Other
Ether Real Mix is a text-to-image generation model based on stable diffusion technology, focusing on generating high-quality, artistic-style images.
Image Generation
E
digiplay
53
2
Rwkv Raven 14b
RWKV is a large language model combining the advantages of RNN and Transformer, supporting efficient training and fast inference with unlimited context processing capability.
Large Language Model
Transformers

R
RWKV
271
57
Cool Japan Diffusion 2 1 2
Other
An anime-style text-to-image model fine-tuned on Stable Diffusion, specializing in 'Cool Japan' cultural content
Image Generation
C
aipicasso
57
15
Genji Python 6B Split
Apache-2.0
GPT-J 6B fine-tuned model for Python code generation, specialized in Python programming assistance
Large Language Model
Transformers English

G
baffo32
16
0
Gpt J 6B Vietnamese News
This is a 6B-parameter Vietnamese causal language model based on GPT-J architecture, specifically trained for Vietnamese news content.
Large Language Model
Transformers Other

G
VietAI
105
12
Featured Recommended AI Models